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Abstract —We study a tandem of agents who make decisions 
about an underlying binary hypothesis, where the distribution 
of the agent observations under each hypothesis comes from 
an uncertainty class. We investigate both decentralized detection 
rules, where agents collaborate to minimize the error probability 
of the final agent, and social learning rules, where each agent 
minimizes its own local minimax error probability. We then 
extend our results to the infinite tandem network, and derive 
necessary and sufficient conditions on the uncertainty classes for 
the minlmax error probability to converge to zero when agents 
know their positions in the tandem. On the other hand, when 
agents do not know their positions in the network, we study 
the cases where agents collaborate to minimize the asymptotic 
minimax error probability, and where agents seek to minimize 
their worst-case minimax error probability (over all possible 
positions in the tandem). We show that asymptotic learning of 
the true hypothesis is no longer possible in these cases, and derive 
characterizations for the mlnimax error performance. 

Index Terms —Social learning, decentralized detection, tandem 
networks, robust hypothesis testing 

I. Introduction 

In this paper, we formulate and study the robust social 
learning problem in a tandem network. A tandem network 
consists of agents connected in a serial fashion, where each 
agent receives an opinion about a binary hypothesis from a 
previous agent, and makes a decision about a binary hypothesis 
based on the previous agent’s opinion and its own observation. 
Despite the simple structure of the tandem network, studying 
it can lead to insights about more complicated network struc¬ 
tures such as those in social networks or Internet of Things 
(loT) networks. The tandem network approximates a single 
information flow in a network, and it and its variants have 
been widely studied in [1]-[10]. 

In our model, each agent’s decision is based on a local error 
criterion, which it selfishly tries to optimize. This behavior is 
present in social networks, where users are mainly concerned 
with spreading only locally accurate information. In this paper, 
we call this social learning [9]-[17]. One such application of 
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social learning is in the case of participatory sensing, where 
inference about a phenomenon of interest is made through the 
help of agents in the network [11], [18], [19]. For example, 
this can occur when users send a picture of litter in a park to 
a social sensing platform [20], [21] or report congested road 
conditions. [22]. 

On the other hand, if the agents’ decision rules are designed 
to minimize the error criterion of the last agent in the network, 
or the asymptotic error probability in the case of an infinite 
network, this is known as decentralized detection [8], [23], 
[24]. One major application of decentralized detection is in 
sensor networks with a fusion center [25]-[27]. If the fusion 
center is able to relay information to the other agents, it will 
be able to select a set of globally optimal decision rules for 
every agent. However, many practical networks, such as social 
networks, do not have a fusion center. Furthermore, even for 
networks with fusion centers, the fusion center may not be 
able to easily communicate with the other agents. This is true 
of the participatory sensing examples above. 

In the above examples, it is assumed that each agent knows 
the distribution of its private observation, and that of its 
predecessor, as well as its position in the network. However, 
in a real-life network, this is generally not the case. In this 
paper, we investigate what happens when one or both of these 
assumptions do not hold. 

A. Related Work 

Binary hypothesis testing in a tandem network model is 
studied in [3], [5], which shows that learning the true hy¬ 
pothesis asymptotically is possible with unbounded likelihood 
ratios, and not possible with bounded likelihood ratios when 
agents transmit only 1-bit messages. Decentralized detection 
policies for tandem networks are also considered in [6], and 
conditions for the error probability approaching zero as the 
number of agents grows are derived. This is a network where 
each agent after the first receives exactly one decision from 
its predecessor. The authors also study a sub-optimal scheme 
where each sensor “selfishly” tries to minimize its own error, 
as opposed to the error of the root agent. The reference [8] 
shows that the rate of error decay is at most sub-exponential. 

Feedforward networks, in which an agent obtains informa¬ 
tion from a subset of previous agents not necessarily just 
the immediate predecessor, have been studied in [9], [10], 
[17]. In [10], agents are able to access the decisions of their 
K most recent predecessors. It is demonstrated that almost 
sure learning is impossible for any value of K, and learning 
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in probability is possible for K > 2. A new model where 
forward looking agents try to maximize the discounted sum 
of the probability of a right decision is also considered. The 
reference [9] studies the decentralized detection problem in a 
game theoretic setting, and examines the effect of obtaining in¬ 
formation from different sets of previous agents on the rate of 
error decay. The reference [28] examines the asymptotic error 
rate of feedforward topologies under two types of broadcast 
errors, namely erasure and random flipping. 

All the above works assume that agent’s observations are 
drawn from known distributions under each hypothesis. This 
assumption may not hold in practical networks like loT net¬ 
works, in which sensors’ observation distributions may change 
over time, or in social networks, in which agents’ observations 
may be affected by the agents’ mood at a particular time. The 
robust detection framework was first proposed by [29] for a 
single agent to model the case where the observation distribu¬ 
tions are not known exactly. A survey of results in this area 
can be found in [30]. The underlying probability distributions 
governing the agent observations are assumed to belong to 
different uncertainty classes under different hypotheses, and 
it is shown that under a minimax error criterion, the optimal 
decision rule for the agent is a likelihood ratio test based on 
the pair of least favorable distributions (LFDs). Subsequently, 
the work [31] investigates robust detection in a finite parallel 
configuration, with and without a fusion center. The problem 
of robust sequential detection is studied in [32]. Robust social 
learning however has not been addressed in these works. In 
addition, robust detection and learning have not been studied 
for the tandem network. 

B. Our Contributions 

In this paper, we consider robust binary hypothesis detection 
and social learning in a tandem network in which the obser¬ 
vation models of agents under each hypothesis are uncertainty 
classes of probability distributions. Our main goals are to 
obtain the optimal agent policies under a minimax error cri¬ 
terion, under both decentralized detection and social learning 
frameworks, and characterize the asymptotic minimax error 
probabilities under various scenarios. Our main contributions 
are the following; 

1) For the tandem network, we show that the solutions 
to the robust decentralized detection and social learning 
problems are equivalent to the respective solutions of the 
corresponding classical hypothesis testing problem where 
all the private observations are distributed according to 
the LFDs (Theorem 1). Our proof can be extended to 
general tree topologies, which generalizes a result in [31], 
where a parallel topology is considered. 

2) We show that when the uncertainty classes for all agent 
observations are the same, and agents know their po¬ 
sitions in the tandem, asymptotically learning the true 
hypothesis under both decentralized detection and social 
learning frameworks is not possible if the contamination 
of both uncertainty classes are non-zero, even when the 
log likelihood ratio of the nominal distributions is un¬ 
bounded (Theorem 3 and Theorem 4). This is in contrast 


to the case where the contamination of the uncertainty 
classes are zero [6], [8], in which case asymptotic learn¬ 
ing happens if the log likelihood ratio is unbounded. 

3) When agents know their positions in the network, we 
show that asymptotically learning the true hypothesis 
under social learning is achievable if and only if the log 
likelihood ratio of the nominal distributions is unbounded, 
and there are two subsequences of agents, one corre¬ 
sponding to each hypothesis, such that the contamination 
of the uncertainty class under that hypothesis converges 
to zero (Theorem 5). 

4) When agents do not know their positions in the tandem, 
we show that it is not possible to asymptotically learn the 
true hypothesis. We investigate the cases where agents 
collaborate to minimize the asymptotic minimax error 
probability, and where agents seek to minimize their 
worst-case minimax error probability (over all possible 
positions in the tandem), and characterize the minimax 
error performance in these approaches (Theorems 6 and 

7). 

In this paper, we consider only robust decentralized detec¬ 
tion and social learning in tandem networks, which are very 
simple in structure. Social networks and loT networks are 
much more complex in practice. Therefore, our results are 
limited, and can only be applied heuristically to more practical 
networks. Our analysis forms the foundation for studying more 
complex networks like trees and general loopy graphs, and 
provides insights into designing optimal decision rules for such 
networks. For example, although using likelihood ratio tests 
based on LFDs at each agent is not known to be optimal for 
loopy graphs, we expect this to produce reasonable results 
in practice. Addressing the performance of social learning in 
more complex networks is part of future research. 

The rest of this paper is organized as follows. In Section 
II, we introduce the robust decentralized detection and social 
learning problem in a tandem network. In Section III, we 
provide a characterization for the agents’ optimal decision 
rules in both decentralized detection and social learning in 
a tandem network. We then derive necessary and sufficient 
conditions for asymptotically learning the true hypothesis 
under various simplifications in Section IV. We also study the 
case where agents do not know their positions in the tandem 
in this section. In Section V, we illustrate some of our results 
using a numerical example. Lastly, we conclude in Section VI. 

11. Problem Formulation 

We consider a tandem network consisting of N agents, with 
agent I being the first agent and N being the last (see Figure 
I). Consider a binary hypothesis testing problem in which the 
true hypothesis H is Hi with prior probability tt^ G (0,1), 
for i = 0,1. Conditioned on H = Hi, each agent k in 
the network makes an observation Yk, defined on a common 
measurable space (V, -A), and with distribution Pi^k belonging 
to an uncertainty class 

'Pi,k = {Q I Q = (1 ~ ^i,k)P* + ei,kR, R s TZ} , 

where TZ is the set of all probability measures on (V, -4), P* G 
TZ is the nominal probability distribution, and G [0,1) is a 
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Fig. 1. Hypothesis testing in a tandem network. 


positive constant that is sufficiently small so that and Vi^k 
are disjoint. We assume that all distributions in Po.fc and Vi^k 
are absolutely continuous with respect to one another, and the 
distribution Pj^k from which the observation Yk is drawn from 
is unknown. Furthermore, we assume that conditioned on the 
true hypothesis, the observations of each agent are independent 
from one another. The parameter ^ is also known as the 
contamination for the uncertainty class Pi^k- When ^ = 0, 
we recover the classical Bayesian hypothesis testing problem. 
While agents can have different contamination values eo,fe and 
ei fc, we assume that the nominal distributions Pq and Pj* are 
identical for every agent. 

For k = 1,... ,N, each agent k makes a decision Uk = 
4>k{Yk, Uk-i) G {0,1} about the hypothesis iJ, where (j)k is an 
agent decision rule whose decision i corresponds to deciding 
in favor of Hi, and Uq = 0. For z = 0,1, let = P^p x 
Pi ^2 X ... X Pi^N- Similarly, define = Vi,i x x ... x 
Pi jv- In the decentralized detection problem, our aim is to 
find a sequence of decision rules to 

minimize the maximum probability of error given by 

pDD 

sup 

where 

= 7ToPf,n{^^^\P^^^) ( 2 ) 

In (2), Pp AT and Pm.n are the false alarm and missed 
detection probabilities of agent N respectively, given the 
decision rules and the agents’ observation distributions 
z = 0,l. 

In the social learning problem, the first agent chooses its 
decision rule (pi to minimize 

sup Pe,i((^i,Po,i,Pi,i). 

(Po,i,Pi,i) GPo,ixPi,i 

Each other agent fc, given the decision rules of the 

previous agents 1,..., fc — 1, is able to derive the false alarm 
and miss detection probabilities of agent fc — 1. It then seeks 
to find pk to minimize 

Pf I <^('=-1)) 

sup P,,k{P^’^\P^^\p[^^). 



In contrast to the decentralized detection problem in (1), 
each agent myopically seeks to minimize its local maximum 
probability of error. 

For each agent fc, let pi^k be the density (with respect to 
some common measure) of Pi^k, and p* be the density of P*, 
for i = 0,1. The least favorable distributions (LFDs) for two 
given uncertainty classes Vo^k and Pi ^ is defined in [29] to be 
the pair of distributions {Qo^k, Qi,k) with densities (go,fei 9i,fe) 
such that 

( 1 f (1 - <^o,k)Po{y) for Pi{y)/Poiy) < c" 

^ I (1 - eQ^k)p*i(,y)/c" for p*i{y)/poiy) > c" 


= ] (f “ ei.fc)Fi(2/) for P*i{y)/Po{y) > c' 
\c'(l - ei,k)Poiy) for P*iiy)/Poiy) < c\ 

where 0 < c' < c" < cxd are determined such that go,fe and 
qi^k are probability densities. Note that c' = 0 if and only 
if ei,fc = 0, and c" = cxd if and only if eo,fc = 0. Let bk = 
(1 - ei.fc)/(l - eo.fc)- We then have 


qi,k{y) 

Qo,k{y) 


bkc' 

pl{y)/pl{y) < c' 


b, . 

^ p*o(y) 

r„.. J ^ Pi{y) ^ „// 

(4) 

bkc" 

fox p*i{y)/pl{y) > c". 



In [29], it was shown that the LFDs of a pair of uncertainty 
classes are the two distributions that give the largest error 
probability when using a likelihood-ratio test to tell them apart. 

In the rest of this paper, for any random variable Y with 
distributions drawn from a given pair of uncertainty classes, 
we let 1*{Y) be the likelihood ratio qi{Y)/qo(Y), where 
go and gi are the respective densities of the LFDs of the 
given uncertainty classes. In addition, we use 1*{Y = y) to 
denote the realization of 1*{Y) when Y = y. A sequence 
xi,X 2 , ■ ■ ■ ,Xn is denoted as {xi)2=i- 


III. Robust Learning in a Tandem Network 

When there is only a single agent, the minimax error 
inf^ Pi^{p) is achieved by setting ^ to be a likelihood ratio 
test using the LLDs (Qo.ijQi.i) [29]. A similar result is 
proven in [31] for a parallel network configuration. In this 
section, we show the same result for the tandem network. We 
do this by first showing that for any sequence of agent decision 
rules that consists of likelihood ratio tests between LLDs, the 
error probabilities (1) and (3) are maximized when all the 
private agent observations are drawn from the corresponding 
LLDs. We then show that when agents’ private observations 
are drawn from the respective LLDs, then likelihood ratio tests 
using LLDs minimize the error probabilities. We first state the 
following lemma proven in [29]. 

Lemma 1. Suppose that the LFDs for (Pq, Pi) are {Qo, Qi)- 
Then, for any random variable Y with distributions Pi € Vi, 
where z = 0,1, we have 

Po{l*{Y)>t)<Qo{r{Y)>t) 

<Qiil*{Y)>t) 

<Pi{l*iY)>t). 
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We can now present our first result. For all k > 1, let 
(Qo.fc, Qi.fe) be the LFDs for {Vo,k,Vi.k), and x 

Qt ,2 X ■■■ X Qi^n for i = 0,1. 

Theorem 1. Let be any sequence of monotone likelihood 
ratio tests based on the LFDs for a tandem 

network. Then for all € Vq x 'P[^\ we have 

> PF,N{(t>^^\Pi,^^), (5) 


From Lemma 3, the product of l*{Yk) and l*{Uk-i) is 

(k) 

stochastically larger under Qq than under any other distribu- 

(A;') 

tion Pq ’ as well. Therefore, we have 

Qf(fJk = 1 ) = Qf{l*{lJk-i,Yk) > tk) 

>P^’"\l*{Uk-uYk)>tk) 

= Po{Uk = 1). 


and 

PM,N{4>^''\Q[''^)>PM,N{f^^\P['''^)- (6) 

Proof: We proceed by mathematical induction on N. 
From Lemma 1, the theorem holds for N = 1. We now assume 
that it holds for N < i. We also make use of the following 
two lemmas, the first of which is proved in Appendix A-A, 
while the second is shown in [31]. 

Lemma 2. For any N, Q[^\Un = 1) > Qq^\Un = 1) and 
Q[^\Un = 0) < Qi^\UN = 0). 

Lemma 3. Let Zi and Z 2 be non-negative, independent 
random variables. If for k = 1,2, we have 

F{Zk>t) >G{Zk>t), Vf>0, (7) 

then 

F{ZiZ 2 >t) > G{ZiZ 2 >t), Vf > 0. 

If (7) holds, we say that Zk is stochastically larger under F 
than G. Since the observation of agent k is independent from 
the decision it receives, for i = 0,1 we have 


The proof for the missed detection probability inequality (6) 
is similar, and the induction is complete. The theorem is now 
proved. ■ 

Theorem 2. Let be an optimal sequence of decision 

rules such that 


4 ^^ = argIninPe,^r( 4 ^^Q 4 ^Cl^^)• 


Then, minimizes P^^f) in (1). Similarly, for each k>l, 
define ipit^k recursively as 

i’k 


where '0*^0 is ignored. Then, minimizes P^^{- \ 
in (3) for all k >1. 

Proof: In [23], it was shown that is a sequence of 
likelihood ratio tests based on Hence, for any 

sequence of decision rules 4>^^\ we have from Theorem 1, 


Qf\l*{Yk)>t) = Q,^k{l*{Yk)>t). 

From Lemma 1, l*{Yk) is stochastically larger under than 

(k) h 

under any other distribution Pq G Pq . To show the same for 
l*{Uk-i), we obtain from Lemma 2 that 


l*{Uk-i = 1) 


Qi’^-^\Uk-i 


> 1 


1 ) 

1 ) 


^ l - Q[^-^\Uk-i = 1) 

~ I - Ql!^~^\Uk-i = 1) 

> l*{Uk-i = 0). 

For any l*{Uk-i = 0) < f < l*{Uk-i = 1), 

Q\^\l*{Uk-i) >t) = Qi^\Uk-i = 1) 

> Po('=)(Pfe_i = 1) 

= Po(")(r(Pfe_i) >f), 


where the inequality follows from the induction hypothesis. 
Note that l*{Uk-i) only takes the two values l*(Uk-i = 0) 
and l*{Uk-i = 1), so for any t not in between these values, 
the equality Q^^\l*{Uk-i) > t) = P^^\l*iUk-i) > t) 
is trivially true. Therefore, l*{Uk-i) is stochastically larger 
under . 


sup Pe,Nifi^\P^^\Pi''^) 

<Pe,iv(4'^\Q[,'^\g4^) (8) 

< sup Pe,N{(l)^^\Po^\P^^^) 

(p^^\pr)epr^T^r 

where (8) follows from the theorem assumption. Therefore, 
the minimax error in the decentralized detection problem is 
equal to the minimum error when all the distributions of the 
private observations are equal to the LFDs. A similar argument 
holds for the social learning problem in the second part of this 
theorem. The proof is now complete. ■ 

We remark that Theorem 2 can be extended to general tree 
topologies. This is because in such a topology, the decisions 
received by each agent are mutually independent. Hence, 

m 

l*{Yk,Uk„. .■,UkJ = l*{Yk)l[ l*{Uk,), 

i=i 

where ki,... ,km are the agents that agent k is receiving 
decisions from. The rest of the proof then proceeds similarly 
as that of Theorems 1 and 2. 
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IV. Asymptotic Detection and Social Learning 

Theorem 2 shows that there is no loss in optimality in 
both the decentralized detection and social learning problems 
if agents in a tandem network are restricted to monotone 
likelihood ratio tests based on the LFDs. It however does not 
tell us the minimum minimax error probability achievable. 
In this section, we study the minimax error probability in 
long tandems under various technical assumptions in order to 
simplify the problem. In particular, we investigate the condi¬ 
tions under which the minimax error probability converges to 
zero as the number of agents increases.* We first consider the 
case where every agent has identical uncertainty classes, and 
provide necessary and sufficient conditions for the minimax 
error probability to approach zero under both decentralized 
detection and social learning. We will then proceed to analyze 
social learning in long tandems where the contamination of the 
uncertainty class can differ. Finally, we study the achievable 
asymptotic minimax error probability when agents do not 
know their own positions in the tandem. 

A. Identical Uncertainty Classes 

In this subsection, we make the following assumption that 
the uncertainty classes of every agent are identical. 

Assumption 1. For all k > 1, we have eo,fc = eo, ei,fc = 
ei, and the LFDs of each asent’s uncertainty classes are 
(Qo,Qi). 

The following two results give necessary and sufficient 
conditions for the minimax error probability to approach 
zero under the decentralized detection and social learning 
frameworks, respectively. 

Theorem 3 (Decentralized detection). Suppose that Assump¬ 
tion 1 holds, and that the decision rules for every agent 
are chosen so as to minimize in (1). Then 

0 as N ^ oo if and only if at least one of 
the following is true: 

7) eo = 0 and the log-likelihood ratio of Pj* versus Pq is 
unbounded from above, 

2) Cl =0 and the log-likelihood ratio of Pj* versus Pq is 
unbounded from below. 

Proof: See Appendix A-B. ■ 

Theorem 4 (Social learning). Suppose that Assumption 1 
holds, and that the decision rule for each agent is chosen 
sequentially so as to minimize P^^{fk \ in (3). Then 

Pt(h I ^ 0 as k —y oo if and only if cq = ci = 0 

and the log-likelihood ratio of P* versus Pq is unbounded. 

Proof: It was demonstrated in Proposition 3 of [6] that 
using social learning decision rules, when eo = ei = 0, the 
error probability in a tandem network where all agents have 
the same observation distributions will converge to zero if and 
only if the log-likelihood ratios between the two probability 
distributions is unbounded from both above and below. The 

*In the case where the agents’ contamination values for their uncertainty 
classes are zero, [9] calls this asymptotic learning. 


sufficiency of the given condition then follows immediately. 
To show that it is also necessary, observe that if eg > 0 or ei > 
0, then the log-likelihood ratio of Qi versus Qo is bounded 
from either above or below respectively, and Theorem 2 and 
Proposition 3 of [6] implies that is bounded away 

from zero. The proof is now complete. ■ 

From the theorems above, it can be seen that for asymptotic 
learning to occur in the social learning case, it is necessary that 
both the distributions be uncontaminated. In the decentralized 
detection case, it is only necessary that one of the distributions 
be uncontaminated. 


B. Varying uncertainty classes for social learning 

In this subsection, we relax the assumption of identical 
uncertainty classes for all agents in the previous subsection, 
and study the effect of varying contamination values on the 
asymptotic error probability in the social learning framework. 
We make the following assumptions. 

Assumption 2. Bfe have 

(i) the log-likelihood ratio of P* versus Pq is unbounded; 
and 

(ii) each agent k > 1 knows its own contamination values 
ei,k, for i = 0,1, as well as those of its predecessors, 
and its position in the tandem network. 


Assumption 2(i) is necessary because otherwise learning 
the true hypothesis is not possible under a social learning 
framework even if all the contamination values are zero, 
as shown in Theorem 4. We will show that under these 
assumptions, learning the true hypothesis happens if there exist 
infinite subsequences (eo,fc„)n>i and {eij.^)n>i (which may 
potentially be distinct) that converge to zero as n increases. 

We first observe that under the social learning framework, 
agents minimize their local maximum error probability. This 
implies that regardless of the values of eo,fc and ei fc, the 
minimax error probability of each agent k is non-increasing in 
k. This is because any agent can simply pass on the decision of 
the previous agent if no other decision rule leads to a decrease 
in minimax error probability. 

For ease of notation, we let QF,k = PF,ki4>^^\Q^'^) and 
Qivr.k = PM,k{(l>^^\Q^a^) be the LFD false alarm and miss 
detection probabilities of agent k respectively, where is 
the sequence of optimal social learning decision rules that 
minimizes P^^ifk \ in (3). From [6], it can be shown 

that the decision rule used by agent k is of the form 


Uk = 


I ^ '^oQF,k-l 

if l*(Yk) > 

V KJ _ TViQu.k-l 


if l*{Yk) 


(9) 


Theorem 5. Suppose that Assumption 2 holds, and that each 
agent k in a tandem network chooses decision rule fk to 
minimize Pl^{(j)k I in (3). Then P^^ifk \ ^ 0 

as k ^ oo if and only if there exist infinite subsequences 
eo,fc„ 0 and eij„ —>■ 0 as n —>■ oo. 

Proof: See Appendix A-C. ■ 
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We observe that agents in a tandem network in the social 
learning framework have total error probabilities at least that of 
agents adopting decentralized detection rules. Hence, Theorem 
5 also provides a sufficient condition for the minimax error 
probability (1) under a decentralized detection framework to 
converge to zero. 

C. Unknown agent positions 

In a social network, users have to make their decisions not 
knowing how many hops information has been propagated 
from a source node. We model this in a tandem network by 
assuming that each agent has no knowledge of its position in 
the network. We make the following assumption, in addition 
to Assumption 1, in this subsection. 

Assumption 3. Every agent k > 1 uses the same decision 
rule. 


deterministic decision rules may yield a lower asymptotic 
maximum error probability than either of the two deterministic 
rules. Thus, we introduce the randomized likelihood ratio test; 


Uk 


'o if r{Yk) < h 

A if i*(rk) = h 

< Uk-i if h < l*{Yk) < to 
B if l*{Yk)=to 
lif l*{Yk)>to, 


( 11 ) 


where A is equal to 0 with probability p and equal to Uk-i 
with probability 1 — p, B is equal to Uk-i with probability 
q and equal to 1 with probability q, and ti, to, p and q are 
constants to be determined. 

We will now show how to obtain the minimax asymptotic 
error probability. To do so, we start with the following two 
lemmas. 


Except for agent 1 (which knows its position in the tandem 
because it does not receive any preceding messages), every 
other agent in the tandem does not know its own position, and 
has access to exactly the same information when it comes to 
choosing a decision rule, which is based solely on the nominal 
distributions, Pg and P*, as well as the contamination values 
eo and ei. Therefore, it is natural to make Assumption 3. 

Because of Assumption 3, any sequence of k agent decision 
rules has the form = (^i, where (pi is the decision 

rule used by the first agent, and p is the decision rule used 
by every other agent with = {<p,...,(/)) consisting of 

k — 1 copies of p. For simplicity, and by abusing notation, we 
replace in our notations by {pi , p) in the sequel. 

In the following, we consider two different scenarios. 

1) Minimizing asymptotic error: We consider the case 
where agents are collaborating to minimize the asymptotic 
error. This might occur when there is a chain of agents trying 
to relay some information to a fusion center, but each agent is 
unsure of how many other agents there are between itself and 
the fusion center. For a given decision rule p, the asymptotic 
maximum error probability is given by 

PZ^{P)= lim P^°{Pi,P), (10) 

K—yOO 

where we have implicitly assumed that P™ does not depend 
on pi. This assumption is valid, as shown in the next theorem, 
which also provides a characterization for the optimal p which 
obtains the asymptotic minimax error probability, defined as 

4> 


Lemma 4. Suppose that Assumptions 1 and 3 hold. Then, 
lim P,^k{{Pi.P),QlQ\) 

k —^oo 

_ _ TroQo{P{Yi,0) = 1) _ 

" Qo(<)>(yi,0) = l)+Qo(<^(n,l) = 0) 

^iQi(<^(yi,l) = 0) 

gi(<^(yi,i) = 0) + Qi(</>(Fi,0) = i)- 

Proof: We have the two following recurrence relations: 

PF,k{{Pl.P).Ql) 

= Qo{Uk = l) 

= Ql-\Uk-i = 0) • Qo{Uk = 1 I Uk-i = 0) 

+ Ql-\Uk-i = l)-Qo{Uk = l\ Uk-i = 0) 

= (1 - Q'^o~\Uk-i = 1)) • Qo(<^(n, 0) = 1) 

+ Q^o~\Uk-i = l)-Qo{P{YkA) = l) 

= PF,k-i{{Pi,P),Ql-^) 

• [Qo{p{YiA) = 1) - Qo{P{YuG) = 1)] (12) 

+ Qo{P{Yi,Q) = l), 


PM,k{{Pl,P),Qo) 

= QUUk = 0) 

= Q’{-\Uk-i =0)-Qi{Uk=0\ Uk-i = 0) 

+ Q^i-\Uk-i = 1) • Qi{Uk = 0 I Uk-i = 1) 
= = 0) • Qi{P{Yk,0) = 0) 

+ (1 - Qt^Uk-i = 0 )) • QoiPiYk, 1 ) = 0 ) 


In the previous sections, we had no need to consider ran¬ 
domized decision rules. This is because under the decentral¬ 
ized detection framework, the final error probability under a 
randomized sequence of decision rules is no less than the 
minimum final error probability of each of the respective 
deterministic sequences of decision rules. Similarly, under 
the social learning framework, the error probability of an 
agent using a randomized decision rule is no less than the 
minimum error probability under each of the deterministic 
rules. However, this property does not hold for the asymptotic 
maximum error probability. A randomized version of two 


= PM,k-liiPl,P),Q^l-^) 

• IQiiPiYiA) = 1) - Qi{PiYi,0) = 1)] (13) 

+ gi(<^(ri,0) = 0), 


The first recurrence relation converges linearly to 

_go(^(yi,o) = i)_ 

go(</.(yi,o) = i) + go(</>(Yi,i) = o)’ 
and the second recurrence relation converges linearly to 
T:iQi{p{Yi,l) = 0) 

gi(</.(yi,i) = o) + gi(</.(ri,o) = i)- 


(14) 


( 15 ) 
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Hence, the proof is complete. ■ 

Next, we will show that if the observation distributions are 
all drawn from the LFDs, then the decision rule that minimizes 
that asymptotic error probability is a randomized likelihood 
ratio test between Qo and Qi. 

Lemma 5. Suppose that Assumptions 1 and 3 hold. For any 
(j>i, let 

(/)* =argmin lim 

(p k—¥oo 

Then there is no loss in optimality if is restricted to be a 
randomized likelihood ratio test between Qo o.nd Qi. 

Proof: Assume that is not a randomized likelihood 
ratio test between Qo and Qi. For any agent k > 2, consider 
a randomized likelihood ratio test f in the form presented in 
(11), where tQ,ti,p,q are chosen such that 

Qoif'iYk,0) = 1 ) = Qo{l*{Yk) > to) 

= QoiMYk,0) = l) 

Qiif'iYk,l) = 0) = Qi{l*{Yk) <h) 

= Qi{MYk,l) = 0). 

From the Neyman-Pearson lemma, we then have 

Qi{f'iYk,0)=0) = Qi{l*{Yk) <to) 

<Qi{MYk,0)=0) 

Qo{f'{Yk,l) = l)=Qo{l*{Yk)<h) 

<Qo{MYk,l) = l)- 

Hence, from (12) and (13), it is clear that for any fi and any 
fc > 1, we have 

Ppk{{4>i,f*),Qo,Q\) > Qo, Qi), 

and 

PMA{4>i,<k*),Qo,Q\) > PM.kii^i,^'),Qo,Qi), 

and hence 

PeA{4>i,<k*),Qo,Q\) > PeM4>i,4>'),Qo,Qi)- 

By choosing suitable values of tQ,ti,p,q, we can set 
Qo((^(yfc, 0) = 1) and Qo{f{Yk, 1) = 1) to any value between 
0 and 1. Similarly, we can also set Qi{(j){Yk,l) =0) and 
Qi{(j){Yk, 0) = 0) to any value between 0 and 1. Hence, 

min lim Pe,k{{4>i,f),Qo,Qi) 

4> k —f(X> 

is attainable, and the decision rule used to attain it can be 
assumed to be in the form of a randomized likelihood ratio 
test. ■ 

The proof of Lemma 5 shows that the asymptotic minimax 
error probability is attainable even when the likelihood ratio 
of Yk is not continuous. To avoid cumbersome notation, for 
the rest of the paper, we will assume that the likelihood ratio 
of Yk is continuous. It is easy to extend our results if this is 
not the case. 

We can now prove the following theorem, which provides 
an expression for the minimax asymptotic error. 


Theorem 6. Suppose that Assumptions 1 and 3 hold, and let 
(/)* be the randomized likelihood ratio test such that 

(^»=argmin lim Pe,k{{fi,(l)),Qo,Qi)- 

(f) k—foo 

Then, 

miP^^{4>) = lim P,,k{{fiAQ,QlQ\). 

(p k—^oo 

Proof: From Theorem 1, we have 

lim P,,k{{fi,f*),Qo,Q\) 

k —foo 

= lim sup Pe,k{{fi,f*),Po’'\p[^^) 

= P^{4>Q- 

By the theorem assumption, for any decision rule f we have 

lim Pe,kii(l)i, <!)*), Qo, Qi) < lim Pe,k{{(l>i,4'),Qo,Qi)- 

k —foe k—foo 

Hence, for any decision rule f, we have 

P !^(</') = lim sup Pe,k{{fi , f ), Po^'’, P^^^) 

p^''^GPoL-Pi''’ePi' 

> lim Pe,kii(l>i,(l>),Qo,Qi) 

k—foo 

> lim P,,k{{fufQ.QlQ\) 

k—foo 

= PZ^i^*)- 

The proof is now complete. ■ 

Theorem 6 states that each agent should find the decision 
rule to optimize the asymptotic minimax error as if its ob¬ 
servations were distributed according the the LFDs of the 
uncertainty class. This is consistent with our results when 
agents do know their positions (Theorem 1). However, the 
exact threshold values for </>* are difficult to compute in 
general, but can be found using numerical methods. Together 
with Lemma 4, since the uncertainty classes Vo,k and Vi,k 
are disjoint for all k. Theorem 6 shows that asymptotically 
learning the true hypothesis is impossible when agents do not 
know their own positions and also provides an expression for 
the asymptotic minimax error. 

2) Minimizing error of current agent: We now assume that 
every agent is acting to minimize its local minimax error 
probability, and that each agent past the first does not know 
which position it is in. This is true in general for most 
social networks, where it is difficult to find the root of any 
information spread. Hence, users typically would not know 
how many hops information has been propagated from its 
source. 

We find the decision rule to minimize the maximum error 
probability by allowing each agent to consider the maximum 
error for every possible position it might be in, then finding the 
decision rule that minimizes this value. Like in the previous 
subsection, we also assume that Assumptions 1 and 3 are in 
effect. More specifically, for each agent fc > 2, we wish to 
find (p that minimizes 

-PmaxW = SUpP|^((/) I 

k>2 


( 16 ) 


where (f)i is the optimal decision rule that minimizes 
We will show that for the optimal decision rule (p, the 
maximum error probability occurs either in the second position 
or at the asymptotic limit, as defined in (10). 

Theorem 7. Suppose that Assumptions 1 and 3 hold. Then 
(/)* = argmin^ is a randomized likelihood ratio test 

between the LFDs Qq and Qi, and 

= max{Pf I , 

where ts as defined in (10). 

Proof: See Appendix A-D. ■ 

Finding an analytical form for in Theorem 7 is difficult. 
However, as is known to be a randomized likelihood ratio 
test of Qo and Qi, this can be done numerically by minimizing 
max{P|^(^* I (</>*)} with respect to the thresholds 

for (ptf. 


We now present a numerical example using the exponential 
distribution. First, we fix PJ as an exponential distribution 
with mean 1 and let P* be an exponential distribution with 
mean 2. We set eg = ei = 0.01, and ttq = tti = 0.5. Then, 
the optimal decision rule for agent 1 has the form: 

0 if l*{Yi) < 1 
1 if l*{Yi) > 1. 

Note that for this case, if ti c' and to c", then it does 
not matter what p and q are. This is because Qo{l*{Yk) = x) 
and Qi(l*{Yk) = x) are both zero unless x = c' or x = c" 
when the nominal distributions are both continuous. We plot 
I against k when p = pA and ps, which are 

randomized likelihood ratio tests of the form (11). Both rules 
have ti = c' and p = 1, but pA has to = 5 and pB has 
to = 1 . 1 . 


V. Numerical Results 

In this section, we provide numerical results to illustrate part 
of our theoretical contributions in Section IV-C, which shows 
that even if an agent fc’s position is unknown, where k > 2, 
its optimal decision rule can be chosen to be a randomized 
likelihood ratio test between the LFDs Qo and Qi. This is 
true whether the agent is collaborating with others to minimize 
the asymptotic error (decentralized detection) or is trying to 
minimize its own error probability (social learning). We have 
shown this in Theorems 6 and 7 respectively. The form of this 
randomized likelihood ratio test </)* is given in (11). Now, we 
can rewrite this in terms of an optimzation problem. 

First, given the nominal distributions Pq* and P*, as well 
as the contamination values eg and ei, we can use a binary 
search to compute c' and c". From the definition of the LFDs, 
we know that the range of thresholds we have to optimize over 

Then, we 


is bounded between he' and he", where b = 


1 - eo’ 


can derive the LFDs Qo and Qi and obtain 

QoiMy^) = ^) = Qo{l*{Y) > h) 

+ (l-p)Qo(r(F)=fi), 

and 

QoiMY.O) = 1) = Qo{l*{Y) > to) 

+ {l-q)Qo{l*{Y)=to). 

Similarly, we have 

Qi(<^*(r,i) = 0) = Qi(r(y) <fi)+pQi(r(y) = fi) 

and 

QiiMY.O) = 0) = Qi(r(r) < to) + qQi{l*{Y) = to). 



Fig. 2. Comparison of decision rules. 


Figure 2 shows that the total error probability is decreasing 
over k for pA- Hence, the maximum error probability occurs 
when k = 2. For fs, the maximum occurs at the asymptotic 
limit k ^ oo. This is in line with our conclusion in Theorem 
7. 

Next, we let Pg be an exponential distribution with mean 
1 and P* be an exponential distribution with variable mean. 
We let eg = ei = 0.01. We denote 

pf = &rgrmnP^^^{(i>), 

Ip 

and 

= arg min P°° ((?!)), 

4> 


In the case where agents collaborate to minimize the 
asymptotic maximum error P^, we minimize the expression 
in Lemma 4 to obtain the optimal decision rule 4>^,. From 
Theorem 6, since (j)^, is a randomized likelihood ratio test, we 
can perform the optimization over Q, fg, p and q in (11). 

Similarly, in the case where agents minimize their local 
maximum error probability P^ax’ minimize the expression 
in Theorem 7 over ti, to, p and q. 


where Pmax(<^) P^{cj)) are defined as in (10) and (16) 
respectively. 

Figure 3 shows that as the mean of Pf increases, both 
^max('/’*^) P™(0™) decrease. This is intuitive as the 

Kullback-Leibler divergence of Pf from Pg* increases as the 
mean of P* increases. As the nominal distributions become 
easier to differentiate, the asymptotic error probability de¬ 
creases. 
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Fig. 4. Error probability versus contamination value. 

Figure 4 shows that both in¬ 

creases as eo = Cl increases. For this graph, Pq and 
are kept constant as exponential distributions with means 1 
and 2 respectively. As expected, as uncertainty increases, 
so does the asymptotic error. When the uncertainty is large 
enough, both converge towards 

0.5. Furthermore, as uncertainty increases, the gap between 

decreases. In a network with a lot 
of uncertainty, there is not much incentive in trying to get 
agents to collaborate, as agents selfishly trying to minimize 
their own error probability leads to similar error performance. 

VI. Conclusion 

We have shown that in a tandem network where agents’ 
observation distributions are not known exactly, and belong to 
uncertainty classes, the minimax error probability is obtained 
by assuming that each observation is distributed according to 
the LFDs of the uncertainty classes. In the case where agents 
know their positions in the tandem network, asymptotically 
learning the true hypothesis is in general impossible when 


the uncertainty classes have sizes bounded away from one, 
even when the log likelihood ratio of the nominal distributions 
of the uncertainty classes is unbounded. To achieve asymp¬ 
totic learning of the true hypothesis in social learning, we 
require the additional condition that the uncertainty classes’ 
contamination values decay over the tandem network. In the 
case where agents do not know their positions in the tandem, 
asymptotic learning of the true hypothesis becomes impossible 
even if contamination values are zero. We characterized the 
minimax error performance in this case, which provided a way 
to determine the optimal decision rules for the agents. 

In this work, we have restricted our attention to the tandem 
network. It would be of interest to extend some of our results 
to tree networks, and even to loopy general graphs. Another 
future research direction would be to consider the robust 
detection problem with more than two hypotheses. A possible 
approach to this problem could be to focus on the LFDs on 
each possible pair of uncertainty class. Lastly, for the problem 
where each agent does not know its position in the network, 
we could instead consider each agent having partial knowledge 
of his position in the network, and find conditions under which 
learning the true hypothesis asymptotically is possible. 

Appendix A 

Proofs of Main Results 
A. Proof of Lemma 2 

We will prove this lemma using mathematical induction on 
N. The likelihood ratio test for agent /c > 1 is of the form 

1 if r(rfe,t/fe_i) >4, 

0 otherwise, 

where tk is some chosen threshold. From Lemma 1, we have 

>Qi"\l*{Yi)>h) 

so that the lemma holds for N = 1. 

Assume that the lemma is true for N = k—1. Each agent’s 
observation is independent of the previous agent’s decision. 
Hence, we have 

and the likelihood ratio test can be rewritten in the form 

[1 iil*iYk)>tl 

Uk=<Uk-i if tl< I*(Yk) <4 
[o if tl>l*{Y,), 

where t]. = tk/l*{Uk-i = i)- We obtain 
Q[''\Uk = l) 

= QiAUk = 11 Uk-1 = = 1) 

+ QiAUk = 11 Uk-1 = 0)Qf~^\Uk-i = 0) 

= QiAl*iYk) > tl)Q['^-^\Uk-i = 1) 

+ QiAl*iYk) > 4){1 - Q[’^-^\Uk-i = 1)). 
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From the induction hypothesis, we then have 

+ QiAl*iYk) > 4){1 - Qf-^\Uk-i = 1 )) 

> QiAl*iYk) > = 1 ) 

+ QiAl*iYk) > 4){1 - Qi^-^\Uk-i = 1)) 

> QoAl*iYk) > = 1) 

+ QoAi*(Yk) > 4)4 - = 1 )) 

= qW([/, = 1 ), 

where the second inequality follows from Lemma 1. The proof 
of the lemma is now complete. 


B. Proof of Theorem 3 

We consider three separate cases, depending on whether cq 
or Cl is nonzero or not. 

Case 1: eg = ei = 0. 

This reduces to Vq and 'Pi (and hence and 

being known exactly, and Proposition 1 in [5] has shown that 
the maximum error rate is bounded above zero if and only if 
the log-likelihood of Pg* and P* is bounded from either above 
or below. 

Case 2: Either eg = 0 or ei = 0, but not both. 

In this case, one of the Pi reduces to the nominal probability 
distribution P*. Without loss of generality, let this be Pg- 
Define (Qo,Qi) = (Pg*,(5i) as the LEDs of Pg and Pi. Eor 
i = 0,1, let the probability density of P* be p*. Since eg = 0 
and ei f- 0, we have c" = oo and c' > 0. Note that for 
Pi(x)/pg(x) > c', we have /*(x) = bp\{x)/p'!^{x), where 
b = 1 — ei. Hence, if pI{x)/pq{x) is bounded from above, 
then for pI{x)/pq{x) > c' we have 


0 < be 

< l*{x) 

= 

Pt){x) 

< oo. 


Erom Theorem 1, we have 

inf sup Pe,Ar {4 ^'', T’g^^, P [^^) 

> iAPe,N{4^\QAQl)^ 

which is bounded above zero as TV —>■ cxd since log(/*(a;)) 
is bounded from both above and below (similar to case 1 ). 
Hence we will assume that the log-likelihood of Pg and Pi* 
is unbounded from above. Then, the log-likelihood of Qq and 
Qi is bounded from below but not from above as well. 

Using the scheme proposed in [ 6 ], we can show that we 
can make the maximum error arbitrarily small as the number 
of agents tends to infinity. We denote this scheme as $ 5 . The 
decision rules of $5 are as follows; 

Eor a given TV* and a threshold t, 


Ui = 



0 for l*{Yk) < t and Uk-i = 0 
1 otherwise. 

Uk = Uk-i- 


Note that $5 is a sequence of likelihood ratio tests between 
Qi and Qq. It was shown in [ 6 ] that by choosing a suitable t 
and TV*, we can get an arbitrarily small error rate if P^^^^ = 
QA To see this, consider a point on the ROC curve of an 
agent using the likelihood ratio test between Qq and Qi with 
its tangent to the ROC curve having slope t. This point is 
iQo{l*(Y) > t),Qi{l*(Y) > t)). As P* = Qi, the initial 
slope of the ROC curve is cxd, and so such a point always 
exists for any t. Erom the concavity of the ROC curve, we 
have 

Qi{l*{Y)>t) 


QoAiY) >t)< 


t 


The asymptotic miss detection probability using the decision 
rules outlined above is thus 

lim PM,Ni^s,QA = Qi{Un>‘ = 0 ) 

N —^oo 

Similarly, the asymptotic false alarm probability is 

lim Pf,n{<^s, Q^) = 1 - (1 - Qo{l*{Y) > t)f' 

N —ioo 

< 1 _ (1 _ 

Choose an arbitrary b > 0. To get the asymptotic miss 
detection probability smaller than 5, we have 


and so 


{l-Qi{l*{Y)>t)f' <S 
\og{S) 


N* > 


iog(i-Qi(p(y) >f))' 


(17) 


Similarly, to get the asymptotic false alarm probability smaller 
than 5, we have 


log(l - b) 


log(i - QiEimM) 

We now make use of the following lemma. 

Lemma A.l. For any probability distribution Qi G P, 

log(l-Q,(/*(y)>T)) ^ 

t _ Qiiippm) 

Proof: Let g(x) = log(l — x), a concave function. Eor 
0 < a; < 1 , we have g(x) < 0 . 

Eor any fixed T > 0, by Jensen’s Inequality, 

1 ,, t 1 ,, / X. 

j9{x) + -^94) < 9{j)- 


> N*. 


(18) 


As p(0) = 0, we have 


gjx) 

5(f) 


> t. 


0 for l*{Yi) < t 
1 for l*{Yi) > t. 
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Letting x = Qi{l*{Y) > t). 


C. Proof of Theorem 5 


Hence, 


iog(i-gi(r(r) > 0 ) 

iog(i _ " 


> t. 


iog(i-Q,(r(y)> 0 ) ^ 

iog(i _ OiMmo) 


From Lemma A.l, by choosing a large enough t, we have 

logjl-Q^{l*{Y)>t)) log(b) 
log(i - SiMmO) log(l-(5) 


and so 

log(l - '^) 2 _ log(^) _ 

log(l - ^ 1 - log(Qi(/*(y) > <)) ■ 

For a sufficiently large t, 

_ log('^) _ , 

1 - iog(Qi(z*(y) > f)) 

Hence we can find an integer N* that lies between the bounds 
in (17) and (18). Then, 

lim < lim 

N —>-oo (pi^) N—>-oo 

= lim sup 'XoPF,Ni^s,Po^^) 

«->00 [N) „(N) 

^0 t ^0 

+ 7 ri sup Pm,n{^5,p[^^) 

p(lV)gp(lV) 


We prove necessity by contradiction. Observe that if such 
subsequences do not exist, then either (eo,fc)5^]^ or {eo,k)^i 
is bounded above zero. Without loss of generality, assume 
that (ei_fc )^2 is bounded above zero. As shown in [29], c' as 
defined in (4) is increasing in ei fe. Therefore, c' is bounded 
above zero for all k > 1. This implies that l*{Yk) is bounded 

above zero as well. Let inf l*{Yk = y) = S > 0. 
k,y 

Assume now that + 'FiQM,k —?> 0 as fc —>■ 0. Then 

both Qp^k 0 and QM,k 0. Choose j such that for all 
k > j, Qp,k + QM,k < 1 and TToQp,k + 5TTiQM,k < Stti. We 
have from (9), 

QF,i+l 

= Qp,,-Qo,j+i [l*{Y,+i)> 

+ (1 — Qf,]) ■ Qo,j+i ^^*(^+i) > 

> Qfj ■ Qo,j+i il*iYj+i) > S) 

= Qf,3- 

By induction, Qp^k > Qfj for all k > j, a contradiction. The 
necessity proof is now complete. 

To prove sufficiency, assume that eo,fc„ 0 and ei —>■ 0. 

We first show a series of lemmas. 

Lemma A.2. Suppose that the LFDs for a pair of uncertainty 
classes {Po,Pi) are {Qo, Qi)- Then, for any random variable 
Y with distributions belonging to these uncertainty classes, 
we have for all t > 0, 

Qi{l*{Y) <t)< tQo(P(Y) <t)- *-Qo{r{Y) < t/2), 


^o(l — 

'^iQ M,j J 




^l(l QM,j^ 


= lim TrQPp^N{^s,Q‘'o^^)+ fiPm,n{^s,Q[^^) (19) 

N^oo 

< FqS + TTlb 

= ■5, 


where (19) is due to Theorem 1. Since <5 was arbitrarily chosen, 
for this case, we can make the maximum error arbitrarily small 
as long as one of the following is true: 

1 ) ei > eo = 0 and the log-likelihood ratio of Pq versus P^ 
is unbounded from above, 

2 ) eo > ei = 0 and the log-likelihood ratio of Pq versus P^ 
is unbounded from below. 

Case 3: eo > 0 and ei > 0. 

Here, c' > 0 and c" < oo and so the log-likelihood of Qi 
and Qo will be bounded. Then 


lim inf sup 






> lim inf 

N-^oo ^(N) 


which is known to be bounded above zero as W —)• oo as the 
log-likelihood of Qi and Qq is bounded from both above and 
below. 


and 

Qa{l*{Y) >t)< \Qi{l*iY) >t)- > 2f). 

Furthermore, the above two inequalities hold when 1*{Y) < t 
and l*(Y) > t are replaced by 1*{Y) < t and l*(Y) > t, 
respectively throughout. 


Proof: To show the first inequality, we observe that 
Qiil*{Y)<t)=tQo{l*{Y)<t) 

-[ {t-x)dQo{l*{Y) = x) 
Jx=0 

<tQo{l*{Y) <t) 

- r {t-x)dQo{l*{Y) = x) 
Jx^O 

<tQo(l*(Y)<t) 


' x—0 


it--)dQoil*iY)=x) 


= tQo{l*{Y) <t) - -Qoil*{Y) <t/2). 


The second inequality follows by interchanging Qq and Qi, 
and replacing f by l/t. The proof is now complete. ■ 
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Lemma A.3. Suppose that Qp^k + QM,k is bounded away 
from zero, and eo,k or ei^k 0 as k ^ oo. Then for any 
integer n, there exists some k' >n such that 

T^oQF,k' + TTiQM.fe' 

< FoQF,k'-l + FiQM,k’-l 
FlQM,k'-l 


Qi,k' l*{Yk') > 


Qo.fc' [l*{Yk>) < 


27ro(l — QF,k'-i) 
FlQM,k'-l 
FoQF,k'-l 

27ri(l — QM.fc'-i) J 

Proof: For any n, choose k' > n such that 
Qi.fc' {l*{Yk') > > 0 or 


7roQ F,fe'-i 


'^'^Q M,k'-1 

Qo,k' (l*{Yk') < ^) ) > possible as we 

assume either QF,k or QM,k is bounded away from 0, and as 
eo,fc or ei^fc —>• 0, the lower or upper bound on l*{Yk) converges 
to 0 or oo, respectively (cf. (4)). Then, we have 

FoQF,k' + FiQM,k' 


= F^Qp^k'-l ■ Qo,k' ( l*{Yk') > 


+ Fo{l — QF,k'-l) ■ Qo,k' > 

+ FiQM,k'-l ■ Ql,k' ( i*{Yk') < 


FoQF,k'-l 
7ri(l — QM,k'-l) 

7ro(l — QF.fc'-i) 


TTlQM.fc'-l 
7ro(l — QF,k>-l) 


FlQM,k'-l 


+ 7ri(l — QM,k'-i) 

■Qi,k' {l*{Yk') < 


FoQF,k'-l 


< FqQf.W-I ■ Qo,k' ( l*{Yki) > 


7ri(l — QM,k'-i), 

FoQF,k'-l 


+ FiQM,k'-l ■ Ql,k' \^*{Yk') > 


7ri(l — QM,k'-i) 
7ro(l — QF,k>-l) 


+ F\QM,k'-l ■ Ql,k' \^*{Yk') < 

+ TToQF.fc'-i • Qo,fe' (^*{Yk') < 

FoQF,k'-l 


Qo,k' I i*{Yk') < 


FlQM,k'-l 
2fo( 1 — QF.fc'-i) 
FlQM,k>-l 

Fo(l — QF,fc'-l) 
FlQM,k'-l 
FoQF,k'-l 
Fi(l — QM,k'-l) 
FoQF,k'-l 
2fi(1 — QM,k'-l) 


for some R gTZ. Since 

(1 - e.,k)P*{l*{Y) <x)<{l- e,,k)Pni*{Y) < x) 

+ e,,fci?(r(r) <x) 

< il-e,,k)P:{l*{Y) <x) + e,,k: 

the result follows immediately. ■ 

We now return to the sufficiency proof of Theorem 5. If 
QF,k + QM,k —> 0, the theorem holds trivially. Therefore 
we assume otherwise. Since P^^{(j)k \ is bounded 

and non-increasing, it converges. Suppose that fqQf.Ic + 
FiQM,k —5” C, for some C > 0. Either limsupfc^((5F,fc„) > 0 
or limsupj.^ ((5M,fc„) > 0. Without loss of generality, let 
limsupj,^ (QF.fcn) = C" > 0. Then there exists a subsequence 
of agents (fc„c)a>i such that (5F,fc„^ —>• C. Choose N such 
that QF,k„ , > %- for all a' > a. Then, from Lemma A.2 
and Lemma A.3, we have 


Pi 




{•Pk. 




< 


■ Qo,k 


I _ '^oQPkrvo.-l 

' ' 7ro(5F,fc„^-i 


< 


2fi(1 — QM,k„^-l) 




{l*(Yk) < 


4fi 

Lrom Lemma A.4, letting a —cxd on both sides of the above 
inequality, we obtain a contradiction. The proof of the theorem 
is now complete. 

D. Proof of Theorem 7 

The decision rule of agent 1, fi, is of the form 


iti = 


£0 

■TTl 


where the inequality follows from Lemma A.2. The lemma is 
now proved. ■ 

Lemma A.4. For i = 0,1, if Ci^k 0, then Qi^k converges 
in distribution to P*, where P* is the nominal distribution of 
the uncertainty class for hypothesis i. 


Oif r(Fi) < 

1 if r(Fi) > fa. 

Hence, the minimax error probability for the first agent is 
FoQoil*(Yi) > f^) + 7riQi(r(Yi) > f^), which is at 
least inf^ This is because using the trivial rule of 

ignoring one’s own private observation and simply passing on 
the decision of the previous agent, we get a constant maximum 

error probability of fo(5o(^*(Ti) > ^)+fiQi{ 1 *{Yi) > f^). 

Hence, we have 


inf supP^^if I (</>!,(/>)) < 7roQo(^*(Ti) > —) 

’t> k >2 TTl 

+ FiQi(r(yi) > ^). 

TTl 


( 20 ) 


Proof: Lrom the definition of the uncertainty classes, we We next prove the following lemma. 


have 

Q^,k e {Q I g = (1 - e,,k)P: + cz.fci?, i? e 7^}. 

Lor all a: > 0, we have 

Q,,kil*{Y) < x) = (1 - e,,k)P*{P{Y) < x) 

+ e,,fci?(r(r) <x), 


Lemma A.5. Suppose that Assumptions 1 and 3 hold. There 
is no loss of optimality if the optimal decision rule for all 
agents k > 2 is restricted to be a randomized likelihood ratio 
test between Qq and Qi. Furthermore, we have 

T’max(</'*) = SUpPe,fc((«!>l,</**),gS.gi)- 

k>2 
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Proof: The proof is similar to that of Lemma 5. From the 
proof of Lemma 5 it is shown that if is not a randomized 
likelihood ratio test between Qo and Qi, there is some decision 
rule (j)'^ which is a randomized likelihood ratio test between 
Qq and Qi such that for any fc > 2, p[^\ 

Pe.fe ((</>!,</>*), </>:), 

and so there is no loss in optimality in assuming that is a 
likelihood ratio test between Qq and Qi- 

From Theorem 1, if is a randomized likelihood ratio test 
between Qo and Qi, then for any k >2, Pq^\ P^\ we have 

Hence, we obtain 

■Pmax(</'*)= sup sup Pe^kHfl, f*), Pt\ 

k>2 Po&Vo,PieVi 

= SUpPe,k{{(l)l,f*),Qo,Ql), 

k>2 

and the lemma is proved. ■ 

We now return to the proof of Theorem 7. From (20), we 
have 

I fi) < supp|^((^ I (<^i,<^*)) 

k>2 

< noQo{l*{Yi) > ^)+7:,Q^{l*{Yi) < ^^ 1 ) 

TTi TTi 

Let 

a = PF,2{{fi,f*), Ql) - lim PF,k{{fiA*),Qo): 

k—foo 

/3 = Pm,2{{4>i,4>*):Qi) - lim PM,k{{4>i,4>*),Qi)- 

k—foo 

and define the function 

gix)= lim Pe,k{{(l)i,(l)*),Qo,Qi) 

k—foo 

+ TToa {QoiMYiA) = 1) - QoiMYi^O) = I))"”' 
+ ^1/3 {QiiMYuO) = 0) - QiiMYi,!) = 0)T~^ ■ 

From the recurrence relation in Lemma 4, we see that for 
integer values of x, this function gives Pe,x{{4>i, f*), Qo,Qi)- 
Differentiating with respect to x, we observe that g{x) has at 
most one stationary point for a: > 0. From (21), we have 
5 ( 2 ) > 5 ( 1 )- Hence, the stationary point, if any exists, must 
be a minimum. This means that for x > 2, the maximum 
value of g{x) must be either at a: = 2 or at its asymptotic 
limit. Thus, the theorem is now proven. 

References 

[1] R. Viswanathan, S. C. A. Thomopoulos, and R. Tumuluri, “Optimal 
serial distributed decision fusion” IEEE Trans. Aerosp. Electron. Syst., 
vol. 24, no. 4, pp. 366-376, 1988. 

[2] Z.-B. Tang, K. Pattipati, and D. Kleinman, “Optimization of detection 
networks. Part I. Tandem stnactures,” IEEE Trans. Syst., Man, Cybern., 
vol. 21, no. 5, pp. 1044-1059, Sep 1991. 

[3] T. M. Cover, “Hypothesis testing with finite statistics,” in Ann. Math. 
Statist., vol. 40, 1969, pp. 825-83. 

[4] M. Heilman and T. Cover, “Learning with finite memory,” Annals of 
Mathematical Statistics, vol. 41, 1970. 

[5] J. Koplowitz, “Necessary and sufficient memory size for m-hypothesis 
testing,” IEEE Trans. Inf. Theory, vol. 21, no. 1, pp. 44^6, 1975. 


[6] D. Papastavrou and M. Athans, “Distributed detection by a large team 
of sensors in tandem,” in IEEE Trans. Aerosp. Electron. Syst., vol. 28, 
1992, pp. 639-653. 

[7] P Swaszek, “leee trans. aerosp. electron, syst.” Aerospace and Electronic 
Systems, IEEE Transactions on, vol. 29, no. 1, pp. 254-260, Jan 1993. 

[8] W. P. Tay, J. N. Tsitsiklis, and M. Z. Win, “On the sub-exponential 
decay of detection error probabilities in long tandems,” IEEE Trans. 
Inf Theory, pp. 4767^771, 2008. 

[9] D. Acemoglu, M. A. Dahleh, I. Lobel, and A. Ozdaglai', “Bayesian 
learning in social networks,” Review of Economic Studies, vol. 78, no. 4, 
pp. 1201-1236, 2011. 

[10] K. Drakopoulos, A. Ozdaglai', and J. Tsitsiklis, “On learning with finite 
memory,” IEEE Trans. Inf. Theory, vol. 59, no. 10, pp. 6859-6872, 2013. 

[11] W. P Tay, “Whose opinion to follow in multihypothesis social learning? 
A lai'ge deviations perspective,” IEEE J. Sel. Topics Signal Process., Mar. 
2015, in press. 

[12] L. Smith and P. Sorensen, “Pathological outcomes of observational 
learning,” Econometrica, vol. 68, no. 2, pp. 371-398, 2000. [Online]. 
Available: http://dx.doi. 0 rg/lO.l 111/1468-0262.00113 

[13] A. V. Banerjee, “A simple model of herd behav¬ 

ior,” The Quarterly Journal of Economics, vol. 107, 
no. 3, pp. 797-817, August 1992. [Online]. Available: 

http://ideas.repec.Org/a/tpr/qjecon/vl07yl992i3p797-817.html 

[14] A. Banerjee and D. Fudenberg, “Word-of-mouth learning,” Games and 
Economic Behaviour, vol. 46, pp. 1-22, 2004. 

[15] S. Bikhchandani, D. Hirshleifer, and I. Welch, “A theory of fads, fashion, 
custom, and cultural change as informational cascades,” Journal of 
Political Economy, vol. 100, no. 5, pp. 992-1026, 1992. 

[16] Z. Zhang, A. Pezeshki, W. Moran, S. D. Howard, and E. K. Chong, 
“En'or probability bounds for binai'y relay trees,” in IEEE Trans. Inf. 
Theory, vol. 58, 2012, pp. 3548-3563. 

[17] Z. Zhang, E. Chong, A. Pezeshki, W. Moran, and S. Howard, “Learning 
in hierarchical social networks,” IEEE J. Sel. Topics in Signal Process., 
vol. 7, no. 2, pp. 305-317, April 2013. 

[18] J.-S. Lee and B. Hoh, “Sell your experiences: a market mechanism based 
incentive for participatory sensing,” in IEEE International Conference 
on Perx’asive Computing and Communications, 2010, pp. 60-68. 

[19] M. Demirbas, M. Bayir, C. Akcora, Y. Yilmaz, and H. Ferhatosmanoglu, 
“Crowd-sourced sensing and collaboration using Twitter,” in IEEE 
International Symposium on a World of Wireless Mobile and Multimedia 
Networks, 2010, pp. 1-9. 

[20] D. Wang, L. Kaplan, T. Abdelzaher, and C. Aggarwal, “On credibility 
estimation tradeoffs in assured social sensing,” IEEE J. Sel. Areas 
Commun., vol. 31, no. 6, pp. 1026-1037, 2013. 

[21] D. Wang, L. Kaplan, H. Le, and T. Abdelzaher, “On truth discovery in 
social sensing: a maximum likelihood estimation approach,” in Proceed¬ 
ings of the 11th international conference on Information Processing in 
Sensor Networks, ser. IPSN ’12. New York, NY, USA: ACM, 2012, 
pp. 233-244. 

[22] INRIX Inc., INRIX Traffic App, [Online]. Available: 
http://http://www.inrixtraffic.com/. 

[23] J. N. Tsitsiklis, “Decentralized detection,” in Advances in Statistical 
Signal Processing. JAI Press, 1993, pp. 297-344. 

[24] P Varshney, Distributed Detection and Data Fusion. New York: 
Springer-Verlag, 1997. 

[25] Z. Chair and P. Varshney, “Optimal data fusion in multiple sensor 
detection systems,” IEEE Trans. Aerosp. Electron. Syst., vol. AES-22, 
no. 1, pp. 98-101, Jan 1986. 

[26] R. Viswanathan and P. Varshney, “Distributed detection with multiple 
sensors i. fundamentals,” Proc. of the IEEE, vol. 85, no. 1, pp. 54-63, 
Jan 1997. 

[27] Y. Lin, B. Chen, and P. Varshney, “Decision fusion rules in multi-hop 
wireless sensor networks,” IEEE Trans. Aerosp. Electron. Syst., vol. 41, 
no. 2, pp. 475^88, April 2005. 

[28] Z. Zhang, E. Chong, A. Pezeshki, and W. Moran, “Hypothesis testing 
in feedforward networks with broadcast failures,” IEEE J. Sel. Topics in 
Signal Process., vol. 7, no. 5, pp. 797-810, Oct 2013. 

[29] P Huber, “A robust version of the probability ratio test,” in Ann. Math. 
Statist, vol. 36, no. 6, 1965, pp. 1753-1758. 

[30] S. Kassam and H. Poor, “Robust techniques for signal processing: A 
survey,” Proc. of the IEEE, vol. 73, no. 3, pp. 433^81, March 1985. 

[31] V. Veeravalli, T. Basar, and H. Poor, “Minimax robust decentralized 
detection,” in IEEE Trans. Inf. Theory, vol. 40, 1994, pp. 35-40. 

[32] E. Geraniotis and Y. Chau, “Robust data fusion for multisensor detection 
systems,” IEEE Trans. Inf. Theory, vol. 36, no. 6, pp. 1265-1279, Nov 
1990. 


